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Abstract 

We refine previous results to provide examples, and in some cases 
precise classifications, of extremal subsets of {1, ...,n} containing no 
solutions to a wide class of non- invariant, homogeneous linear equa- 
tions in three variables, i.e.: equations of the form ax + by — cz with 
a + b^c. 



1 Introduction 

A well-known problem in combinatorial number theory is that of locating 
extremal subsets of {1, ...,n} which contain no non-trivial solutions to a 
given linear equation 

C : a\x\ H h a n x n = b, (1) 

where oi, a n , b € Z and their GCD is one. Most of the best-known work 
concerns just three individual, homogeneous equations 

£i : x\ + x 2 = 2x 3 , 
£ 2 : x\ + x 2 = x 3 + a?4, 
£ 3 : x 1 +x 2 = x 3 , 

where the corresponding subsets are referred to, respectively, as sets without 
arithmetic progressions, Sidon sets and sum-free sets. The idea to consider 



arbitrary linear equations L was first enunciated explicitly in a pair of articles 
by Ruzsa in the mid-1990s [10] [11]. The only earlier reference of note 
would appear to be a paper of Lucht [6] concerning homogeneous equations 
in three variables, though Lucht's article was only concerned with subsets 
of N. Following Ruzsa, denote by r^in) the maximum size of a subset of 
{1, ...,n} which contains no non-trivial solutions to a given equation C (the 
precise meaning of the words 'non-trivial' is given in [10]). When considering 
arbitrary C, one begins by observing a basic distinction between those C 
which are translation-invariant <n = b = 0) and those which are not, 
namely : for the former it is always the case that rc(n) = o(n), a fact 
which follows easily from Szemeredi's famous theorem, whereas for the latter 
r c( n ) = Q(n) always. 

This paper is concerned with non-invariant, homogeneous equations only. 
For simplicity the words '(linear) equation' will, for the remainder of the ar- 
ticle, be assumed to refer to those equations with these extra properties, 
though some of our initial observations also apply in the inhomogeneous 
setting. We shall also employ the concise formulation 1 A avoids O to indi- 
cate that a set A of positive integers contains no solutions to the equation C. 
Finally we will employ the interval notation [a, ft] := {x G Z : a < x < /3}, 
and similarly for open intervals. 

As Ruzsa observed, given an equation C : J2 a i x i = 0) there are two ba- 
sic ways to exhibit the fact that rc{n) = fi(n) : 

I. Let s := J2 a i so s 0- Let q be any positive integer not dividing s and 
let A := {x G N : x = 1 (mod q)}. Then A avoids C and |An[l,n]| > \n/q\. 

II. Set 

s + := a 'h s- := Y N 

ai>0 (ii<0 

and assume without loss of generality that s+ > s_. For a fixed n > let 
A := (jj^n,n> ■ Then A avoids £ and has size Cl(n). 

As in [11], set Ao,£ := lim sup n ^ QO rc ^ . Ruzsa asked whether the above 
two constructions were the prototypes for extremal ^-avoiding sets in the 
sense that Ao = max{p, s +~ s - | ; where the quantity p is defined as follows : 
for each m > let p m • m be the maximum size of a subset of [1, m] which 
contains no solutions to C modulo m. Then p := sup m p m . 
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As illustrated by Schoen [12], the answer to Ruzsa's question is no. But 
from what is currently known, it seems that for many equations something 
not much more complicated holds. One observes that the construction II 
above can be modified into something more general : 

II'. For a given C : J2 a i x i = 0) let notation be as above and let a de- 
note the smallest absolute value of a negative coefficient Oj. Now for fixed 
k, n > and £ € [1, n] set 



fc-i 



U [£,n fe ], 



where ni,...,rik is any sequence of integers satisfying the recurrence 

g g 

n\ = n, — rik < £ < rifc, s+rij+i < a — rij + (s_ — a)£, j = 1, k — I. 
s + s + 

(2) 

Clearly for (2) to have any solution we will need to have k = O(logra). 
Assuming a solution exists, the set A n ^£ avoids C and 



L4 n ,^| = (l + (1)) 



fe-i 



rij + (n k - £ + 1) 



The important special case is when £ = 1 + L^j" n feJ • Then Ruzsa's question 
can be replaced by the following : 



Question Is it always the case that 

Ao = max < p, sup L ^ > , 

{ n,fc,c " J 

where the supremum ranges over all triples n,k,£ for which (2) has a solu- 
tion where £ = 1 + LfjjT^fcJ 

We will give examples in Section 3 which show that the answer to this 
question is still no : we are not aware of any in the existing literature. How- 
ever, existing results strongly suggest that the answer is very often yes : 
see [1] [2] [3] [8] [9] for example, plus further results in this paper. Also, in 
our counterexamples the extremal sets are a pretty obvious hybrid between 
the two alternatives which the question offers. We think that our question 
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is thus a good foundation for further research in this area. 

We now give a closer overwiew of the results in this paper. To identify 
extremal /^-avoiding sets and compute Ao,£ for arbitrary C seems a very 
daunting task, so an obvious strategy is to study equations in a fixed num- 
ber of variables. One variable is utterly trivial and two only slightly less 
so. I have not been able to locate the following statement anywhere in the 
literature, however (though see for example [5], pp. 30-34), so include it for 
completeness : 

Proposition Consider the equation C : ax = by where a > b and GCD(a, b) = 
1. For every n > an extremal C-avoiding subset of [l,n] is obtained by 
running through the numbers from 1 to n and choosing greedily. This yields 
the extremal subset 

A := {u ■ a 2% : i > and a f u} 

of N. For each n > a complete description of the extremal C-avoiding 
subsets of [l,n] is given as follows : 

Case 1:6=1. 

For each u £ [l,n] such that u f a, let a be the largest integer such that 
u ■ a a < n. Then an extremal set contains exactly [a/2] of the numbers 
u ■ a 1 , for < i < a, and no two numbers u ■ a 1 and u ■ a l+l . 

Case II : 6 > 1. 

For each u € [1, n] divisible by neither a nor b and each non-negative integer 
a such that u-a a < n, an extremal set contains exactly [a/2] of the numbers 
u-b l -a a ~ l , for < i < a, and no two numbers u-b l -a a ~ l and u-b t+1 -a a ~ l ~ l . 

Note that the proposition implies in particular that \$ = p for any equa- 
tion in two variables. For three variables things get more interesting and 
a number of papers have been entirely devoted to this situation, see [1] [2] 
[4] [6] [7] plus the multitude of papers on sum-free sets, of which the most 
directly relevant is probably [3]. The combined results of [1], [2] and [3] give, 
in principle, a complete classification of the extremal ^-avoiding subsets of 
[l,n], for every n > 0, and C : x + y = cz for any c / 2. Of particular 



4 



interest for us are the results of [1]. There it is shown that for every c > 4 
and n >> c 0, a set A n ^ of type II' is extremal, namely 

3 /2 
A «>3 := U [-nj,nj 

where 

, (i + L§"jJ) + (i + L§"3j) , . 19 

m = n, n j+ i = L J, J = 1,2. 

Moreover it is shown that, for all n >> c 0, there are only a bounded number 
of extremal sets, all of whose symmetric differences with A n ^ consist of a 
bounded number of elements (both bounds are independent of both n and 
c). 

These results were partly extended in [4]. Here the authors considered 
equations C : ax + by = cz in two families : 

Family I : a = 1 < b. 

Family II : a = b, GCD(6, c) = 1. 

For Family I equations their main result is that, when c > 2(6 + l) 2 — 
GCD(6 + l,c) then sets A Ut 2 of type II' consisting of exactly 2 intervals 
are extremal ^-avoiding sets in [l,n] up to an error of at most O(logn) for 
every n. In particular, these sets give the right value of Ao- They do not 
attempt any classification of the extremal sets, however. They also note 
that, whenever c > (b + 1) 3//2 , the same sets are of maximum size among 
all type II' sets, and conjecture that they are still extremal, up to the same 
O(logn) error. 

For Family II equations they simply note that when c > (26) 3 / 2 , then 
among all type II' sets the largest consist of three intervals. They do not 
discuss whether such sets are extremal or not. 

Our results concern the same two families of equations. For Family I we 
employ the methods of [1] to obtain a classification (Theorem 2.5) of the 
extremal C- avoiding sets whenever 

« > 3££. ,3) 

We show that, for every n »b,c the sets A n ^ are actually extremal and 
there are only a bounded number of possibilities for the extremal subsets of 
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[l,n], all of which have a symmetric difference with A n p of bounded size. 
Both bounds are independent of n, b and c. 

We show by means of an example that the lower bound (3) on c can- 
not be significantly improved, which also disproves the conjecture of Dilcher 
and Lucht. Namely we show that when c = b 2 another type of /^-avoiding 
subset of [1, n] is larger than A n ^ by a factor of f2(n). In some cases we can 
prove that these other sets are in fact extremal and conjecture that this is 
generally the case (conjecture 2.7). 

For Family II equations we describe extremal sets in [l,n] for all n, and 
for every b, c with b > 1 (Theorem 3.1). Their appearance takes three dif- 
ferent forms, for values of c in the following three ranges : (i) c > 2b, (ii) 
2 < c < 26, (iii) c = 1. In contrast to when b = 1, it is not the case for 
c » b that the extremal sets consist of three intervals. Rather they are a 
hybrid between the two alternatives offered by our earlier Question. 

2 Results for Family I equations 

The methods of this section follow very closely those of [1], so we will not 
include full proofs of all results. Nevertheless, several technical difficulties 
arise and considerable care is needed to dispose of them. Thus we will give 
a fair amount of detail anyway, even if the resulting computations become 
somewhat long-winded. Let C be a fixed equation of the form x + by = cz 
such that b > 1 and (3) holds. We prove analogues of Lemmas 2,3,4 and 
Theorems 1,2 in [1]. First a definition corresponding to Definition 1 of that 
paper : 

Definition 1 : Let n G N and A C [l,n] be ^-avoiding with smallest 
element s := sa- Define sequences (r^), (Zj), (Ai) by 



Let t denote the least integer such that r t+1 < s. Observe that for all i >t, 



A := A, n := n, 





A: 



(Ai-i\(r i+1 ,li\) U [li,ri] n [s,n], for i > 1. 




(4) 
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where a = a a '■= max{l ( + 1, s}. 



It is easy to see that, by construction, each set in the sequence (Ai) is 
/^-avoiding provided Aq is, and that At is then an /^-avoiding set of type II' 
in the introduction. The crucial observation is the direct generalisation of 
Lemma 2(b) in [1] : because of its importance and because an apparently 
awkward technicality arises in dealing with one of the cases {Case /below), 
we will provide a complete proof. 

Lemma 2.1 Let n > and A := Aq C [l,n] be C-avoiding. Then \Ai\ > 
l-^i-il f or every i > 0. 

PROOF : Following the same reasoning as in [1], it suffices to prove the 
claim for i = 1, and thus to prove that, for every n > and every ^-avoiding 
subset of [1, n], we have 



where 



|A|<|An[l,r 2iA ]| + r(l-^)nl, 



L^inJ + bs A 
r 2 ,A ■= I— J. 



(5) 



The proof is by induction on n, the case n = 1 being trivial. So suppose the 
result holds for 1 < m < n and let A be an ^-avoiding subset of [1, n). The 
result is again trivial if sa > i^ir) n > so we may assume that sa < n 



and thus that 



T2,A < 



(m.)n + bs A /6+l\ 2 



< 



n < 



n 



because of (3). 

First suppose that there exists z G A n (j^, 
denote A c := [l,n]\A. 



(6+l)n 

c 



To simplify notation, 



Case I : z < bn/c. 

In this case we will show independently of the induction hypothesis that 
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something stronger than (5) holds, namely that 



\a\ < r(i 



6+1 



n 



■1- 



(6) 



We have cz G (n, 6n]. Set cz := t. Now A contains no solutions to the 
equation 



x + by = t. 



(7) 



Hence for every y E [^p-, ^] at most one of the numbers y and t — by lies 
in A. Now t — by = t (mod b) for every y. In this way we can locate in A c 
at least as many numbers as there are numbers in the interval [^-jp, ^-jp] not 
congruent to i (mod 6). Define two parameters u,v £ [1, 6] as follows : 
(i) u = £ (mod b), 



(ii) the total number of integers in the interval 
to v (mod 6). 



t-n t-1 
b ' b 



is congruent 



Then one readily checks that the number of integers in 
gruent to t (mod b) is at least 



t-n t-1 
b ' b 



not con- 



1 



n — (u — 1) 6 — w 



n 



6-l\ (6-l)(«-l)+6(6-u) 



6 2 



6 2 



and is at least one more than this when v < b unless one of the first v 



numbers in the interval 



t-n t-1 
b ' b 



is congruent to t (mod b). Set 



f(n,b,c,u,v) :-- 



(b - l)(u - 1) + b(b - v) 
b 2 



n 



6-1 6+1 



Note that (3) implies that f(n,b,c,u,v) < 2 but, for (6) to be already 
satisfied we would need f(n,b,c,u,v) < 1. This is where things get messy. 
Note that certainly f(n,b,c,u,v) < 1 unless perhaps n>3, v < u — 2 
and one of the first v numbers in the interval , ^ is congruent to 
u (mod 6). The first assumption implies in particular that 6 > 3. All three 
together imply that the numbers 1 and 2 both lie to the left of the interval 
, , and neither is congruent to t (mod 6) . Thus neither can have 
been already located in A c via the pairing arising from (7). Since it suffices 
at this point to locate just one extra element in A c we may for the remainder 
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of this argument assume that b > 3 and that 1,2 G A. The latter implies 
that there are no solutions in A to either of the equations 

x + by = c, (8) 
x + by = 2c. (9) 

To continue the argument we go back to (7). To locate elements in A c we 
paired off numbers in not congruent to t (mod b) with numbers 

in [l,n] congruent to i (mod 6). It would thus suffice if we could also pair 
off at least one further number in the former interval, whch we call X. It is 
easy to see that this can definitely be done if X contains a total of at least 
b 2 numbers. Hence we may further assume now that 

|X| < b 2 , (10) 

and hence that n < 6 3 , though we will make no explicit use of this latter 
fact. 

From (10) we want to conclude that either n < c or, for an appropriate 
choice of the original z, that c = t = (mod b). So suppose n > c. First set 

xi := 1, x 2 :=b + l, y 1 ■= c - b, y 2 := c - b(b + 1). 

Since A contains no solutions to (8), at most one of x« and yi is in A for each 
i = 1,2. Thus yi G ^4 C since n > c and we already know that xi <G A. From 
(3) it follows that x 2 < y 2 and also from (10) that y\ — y 2 > |X|, so that at 
least one of y\ and y 2 must lie outside X. Furthermore, since u > 3, neither 
of the Xj is congruent to t (mod 6). If b + 1 X it is thus already clear that, 
unless c = t (mod b) , we can find amongst x 2 ,y\, y 2 at least one element of 
A c not previously located via (7). But similarly, if b + 1 G X then one easily 
checks that (3) implies that y\ X and so we have the same conclusion. 

Thus we are done if n > c unless c = t (mod b). To get that c = 
(mod b) it would then suffice to also show that 2c = t (mod b). If n > 2c— b 
then this is immediately achieved by a similar argument to the one just given, 
but this time using (9) instead of (8). If n < 2c — b then we just have to 
note that we could have from the beginning chosen z := 2, in which case 
2c = t, by definition. 

Thus (6) holds unless either n < c or c = (mod b). By (3) the latter 
would imply that c > b 2 + kb where we can take k = 3 when b > 3 and k = 4 
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when 6 = 3. Then 



. , (6-l)(u-l)+6(6-t;) fb-1 b + V 
f{n,b,c,u,v) = -= n 



b 2 V b 2 



c 



(6 - 1)(26 - 1) (b-\ 6 + 1 
< tt. n 



b 2 V b 2 b 2 + kbj ' 

We'll be done unless f(n,b,c,u,v) > 1. One checks that this already forces 
n < c when 6 = 3, and that for 6 > 3 it yields (taking k = 3) that 

n< 6 2 + 36 + l + - <c+l + 



6-3 ~ 6-3 
This in turn yields that 

'6+1 



nj < 6 + 1 (11) 



except when 6 = 4, c = 28 and n £ {34,35}. One easily checks that (6) 
holds in these exceptional cases, so we're left with (11). First suppose n > c 
so that |_(^r) n \ = b + 1, and consider (8). For 1 < i < b + 1 set yi := i 
and Xi := c — ib. Then at least one of X{ and t)i is in A c for each i. But (3) 
implies that Xb+i < Vb+ii hence |yl c | > 6 + 1 which proves (6). 

We are thus indeed left with the case when n < c. Now we could have 
chosen z := 1 initially and thus paired off numbers in T\ := ^f 1 , 
congruent to c (mod 6) with numbers in [l,n] congruent to c (mod b] 
usual, it suffices to locate at least one further element in A c . First suppose 

2c - 1 

— 7— <n. (12) 



not 
As 



2c-n 2c- 1 



not 



Then similarly, by (9), we can pair off numbers in I2 :— — 5—, — b - 
congruent to 2c (mod 6) with numbers in [l,n] congruent to 2c (mod 6). 
The crucial point is that, since n < c, the intervals X\ and X2 are disjoint. 
Each interval certainly contains at least three elements by (12). It is then 
easy to see that the ^-pairing will certainly locate at least one more element 
in A c unless, at the very least, 2c = c = (mod 6). But in that case the 
map 0:yi-^y + |isa bijection from X\ to X2 so that if the Xi-pairing pairs 
y with x, say, then the ^-pairing pairs (f)(y) with x. If we now choose y as 
the smallest multiple of 6 in 2Ti, then we see that one of the two pairings 
must locate the desired extra element in A c , unless perhaps f = (mod 6) 
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also. But then c = (mod b 2 ) and thus c > 2b 2 if b > 3 and c > 36 2 if 6 = 3. 
But then, calculating as before, we'll have f(n,b,c,u,v) < 1 unless perhaps 

^ 2(f) 2 -36+1) , J ty 

n < K 6 _ 3 — when o > 3, 
n < 3 %-2? ] ' when 6 = 3. 

But these inequalities contradict (12). Now we are only left with the pos- 
sibility that n < ^p-, hence that |_(^7r) n \ e {1>2}- But in each case one 
may check that one can locate one or two elements of A c as approppriate, by 
considering solutions of (8) with x and y close to This finally completes 
the analysis of Case I. 

Case II : z > bn/c. 

Then cz G (bn, (b + l)n]. Put cz := t again. Let t = (b + l)n — s where 
< s < n. If x + by = t for some integers x, y G [1, n], then y > n — s/b and 
x >n — s. Since A avoids C we thus find, for every integer y £ A n [n — f , n] 
not congruent to £ (mod 6), an integer x £ ^4 C n [n — s,n], congruent to 
t (mod 6). Noting in addition that at least one of n and n — s is not in A, 
one readily verifies that hence 

\An[n-s,n}\< (i_^i)( s + l). (13) 

We now apply the induction hypothesis. Let B := A H [1, n — s — 1]. If 5 is 
empty then (13) and (3) immediately imply (5). Otherwise clearly sb = sa 
and T2,b < r 2,A-, so the induction hyothesis gives that 

\A\ < L4 n [1, r 2 , A }\ + f(l - ^) (n - a - 1)1 + (l - ^) (a + 1), 
from which (5) follows by another application of (3). 

We have thus completed the induction step under the assumption that 
^ ^ ( c' ^ b+ p n 7^ 0, so we can now assume the intersection is empty. Sup- 
pose z G A n (r2,A,n/c]. Then L^r" n J + ^ S A < cz < n and cz — bsA G A c . 
In other words, we can pair off elements in A n (r2,A, f ] with elements in 
(^r-n, n (1 A c . This immediately implies (5) and completes the proof of 
Lemma 2.1. 
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Lemma 2.2 Let A be an C-avoiding subset of [l,n] of maximum size. Let 
s = sa and t := max{i G N : n > s}. If n »b, c i/ien t = 2. 

Proof : Just follow the reasoning in the proof of Lemma 3 in [1]. By 
Lemma 2.1, it suffices to know that there exists an absolute positive con- 



is such that, in the notation of eq.(4), \A 2 \ = D(b, c)-n+0(l) when s = Z2+I- 
The core of a proof that such a constant exists is contained in the proof of 
Lemma 1 in [4], though one has to be a little careful since there only sets 
At in which s = It + 1 are considered. However one can tediously check that 
allowing for arbitrary s G (It, ft] will not change matters (I note that the 
authors of [4] needed this fact in Section 3 of their paper, though they do 
not seem to explicitly mention it anywhere). 

Lemma 2.3 Let n >>b, c 0. If A is an C-avoiding subset of [l,n] of 
maximum size then there exists an absolute positive constant n\ c such that 



stant such that, if t ^ 2 then 



where 



D(b,c) : = 



(c-b-l)(c 2 -b 2 + l) 
c[c 2 -b(b+l)] 



(14) 




PROOF : The proof follows that of Lemma 4 in [1]. We set 



s' := min{s € [1, n] : fa(s) < s}. 



A computation similar to that in the Appendix of [1] yields that 




where ei,e2 € [0, 1). By (3) it follows that 



s' G [5,5+1]. 



(15) 
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Now we have 

\M*) 



[fl-^U] +r 2 (a)-s + l, ifs>s', 
[(1- *±Mn] +r 2 (5)-Z 2 (a), if s < s'. 



First suppose s > s'. We will certainly have \A 2 (s' + 2)| < |^4 2 (s)| because 
of (3) and since r 2 (s) can only increase at most once in |_§J times. This 
proves that sa < S + 2 for a maximum /^-avoiding A. Secondly, if s < s' 
then |A 2 (s)| will decrease once r2(s) — l 2 (s) decreases, and clearly this will 
happen after O(^) steps. This proves that S — k\ c < sa for a maximum A 
and some Kj, c = S~2(^). 

Theorem 2.4 rc(n) = D(b,c) - n + 0(l), where D(b,c) is given by (14)- In 
particular, Ao,£ = D(b,c). 

Proof : See Lemma 1 in [4] and Theorem 1 in [1]. Note that the sec- 
ond statement is the same as in Theorem 1 of [4], just with a better lower 
bound for c, namely (3). 

We can now present the main classification result, analogous to Theorem 
2 in [1]. In fact the result here is in some sense even cleaner, as the max- 
imum ^-avoiding sets consist essentially of two rather than three intervals 
and there is thus even less possibility for variation. 

Theorem 2.5 Let b, c G N with b > 1 and c satisfying (3). Let C be 
the equation x + by = cz. Let n > 0. Define S and s' as in Lemma 2.3. 
Let A be an C-avoiding subset of [l,n] of maximum size and with smallest 
element s = sa- If n »b, c then the following holds : s G [S,S + 2] and 
A = 1 2 U X\ where 

T j {[s,r 2 ], [s,r 2 + 1]}, ifs>s', . . 

X2fc \ {[s,r 2 ),[s,r 2 ]\{r 2 -^}}, tfs<s>, (W) 

for some £i G [1, &], and 



(Zi , n] , i/r 2 + l^i and Z x A, 

[/i,n]\{n - 6} , if r 2 + l £ A andh £ A, 

[h,n] \{h + &}, if r 2 + 1 e A and l\ A, 

[/i,n]\{Zi + ^4,^ - £5} , ifr 2 + leA and l\ G A, 



(17) 
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for some £2 £ [0, n], £3 <G [l,n], £4 G [1,6 — 1], £5 € [0, n]. Moreover, 
in all cases where they arise, the parameters £j, i = 1, ...,5, are uniquely 
determined by n and s according to the following relations : 

d 2 = (&+l)r 2 -£i, (18) 
rfi = (6+l)n-Ci, ^G{2,5}, (19) 
c(r 2 + l) = bs + (h+ti), i€{3,4}. (20) 



Remark : As is the case in [1], Theorem 2.5 does not precsiely deter- 
mine the maximum /^-avoiding subsets of [l,ra] for every n >>b,c 0. For 
any particular n, some of the possibilities listed for A may either not be 
^-avoiding or not have maximum size. But the important point is that we 
have a bounded number of possibilities for any n, and the symmetric dif- 
ference between any two of these possibilities is also bounded in size, both 
bounds being independent of n, b and c. Since the periodicity phenomenon 
described in Section 4 of [1] easily generalises to the present setting, The- 
orem 2.5 thus reduces the precise classification of the extremal ^-avoiding 
sets to a finite computation for any given C. 

PROOF of Theorem 2.5 : Again we follow the approach in [1]. On the one 
hand, since t = 2 here, there are fewer steps in the analysis. On the other 
hand, we will need a somewhat modified argument in one of the steps. We 
shall thus present the full argument quite carefully, though not in every sin- 
gle detail. We will need something analogous to Lemma 1 of [1]. Let zeN. 
Then x + by = cz will have solutions where x and y are close to . Indeed 

if 1 — e := Lf^iJ t nen we nave a solution (xo,yo) = {j^i — be, + e ) • 

For each i > define the solution 

( x » Vi) ■= (j^T[ ~ b (* + £ )' bTi + * + e ) • 

Now for any z G N and d > let 1% denote the interval [xj, yd]- Then the 
analogue of Lemma 1 we need is 

Lemma 2.6 Let A C [l,n] be C-avoiding and z £ A. Then for any d > 0, 

|/ z V|>d + i. 
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Let A be a maximum /^-avoiding subset of [l,n]. By Lemma 2.1 we know 
that \A\ = \Ai \ = \A 2 \ and by Lemma 2.2 that r% < s, when n is sufficiently 
large. The proof of Theorem 2.5 is accomplished in three steps. First, by 
comparing A^ with A\ we show that A contains almost the whole interval 
(72, ^2]- Here the argument entirely parallells that in [1]. In the second step 
we deduce that fa, h] (~l A is almost empty. Here we use Lemma 2.6, but 
in a somewhat different way than in [1], as we will instead use an approach 
similar to that in the proof of Lemma 2.1. The final step is to compare A 
with A\ to show that A contains almost all of (li,n\. 

Step 1 : If s > I2 then clearly [s,r2] Q A. Lemma 2.2 and (15) give 
s £ [5,5 + 2]. Suppose s < £2- We want to show that s = £2- Suppose 
on the contrary that B := [s,l2) is non-empty. Put 

C:=IIU [J 1°. 

beB\{s} 

It is clear that C C f^,^] for all n >>b, c 0. The crucial point is that (3) 
guarantees that all the intervals making up C are pairwise disjoint. Thus 
Lemma 2.6 implies that \C\A\ > \B\, which contradicts the maximality of 
A. Thus s = I2, which implies on the one hand (computation required, using 
(3)) that s > 5, and on the other that 

|An[ S ,r 2 ]| = |(s,r 2 ]|. (21) 

If T2 A we infer that A n [s,^] = [s,^). If r2 G A then cs — br2 = 
d.2 — br2 A, so — c + 1 < c/2 — (b + l)r2 < — 1 and hence 

-c+ 1 c 

< - 7 7 h ~ r 2 < 0. 



6 + 1 ~ 6 + 1 

But if ^-^2 — ^2 < —1 then 1^ C (^2, r 2] (f° r ^ >>6,c 0), which would 
contradict (21). Thus — r 2 G ( — 1 5 0), which confirms that A n [s, r2] = 
[ s > r 2]\{+2 — Ci} f° r the unique £1 £ [1,6] satisfying (18). We also deduce 
that r2 + 1 A since c/2 — 6(r2 + 1) < r2 — 6 and thus lies in A. 

This completes Step 1 and shows that A must contain a set J2 which is 
one of the possibilities given by (16). 

Step 2 : We will show that if n »b, c then A n [r 2 + 2, Zi) = 0. We 
have 

|A| = |A 1 | = |(^\(r 2 ,/ 1 ])U(/ 1 ,n]|. 
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So the idea is to show that if A n [v2 + 2,l\) were non-empty, then it would 
have to have smaller cardinality than (li,n]\A. Since b > 1 the direct 
analogue of the argument in [1] will not work. Instead we gain inspiration 
from the proof of Lemma 2.1. First suppose there exists z G An (j^, ^- 
As in the proof of Lemma 2.1, this implies that 



\ A \< { l -—) n + 2 > 

and thus, by Theorem 2.4, A can't possibly be maximum /^-avoiding for 
n >>5 )C 0. Next suppose there exists z G An (^-, ( b+ ^ n . Again, as in the 
proof of Lemma 2.1, this will gives us a a G [0,n] such that 

\An[n-a,n}\ < (l - b -^j (a + 1). 

Clearly then, by Theorem 2.4, there exists a constant k? > such that A 



cannot possibly be maximum /^-avoiding if a > k? . Thus there exists a 



7>,c 

corresponding k\ c > such that we can now deduce that A n l\ — k\ c 
is empty. 

Let U\ := An [r2 + 2,n/c] and U2 := A n (/1 — Atf c , Zi). It remains to 
show that Z^i and U2 are empty, so let us suppose otherwise. 

For z £ £4, let C z := {cz — bs, cz — 6(s + l)}. Also let C r2+ i := {c(r2 + l) — 
6s}. Then C z nA = (f) for any z G in[r 2 +l,n/c]. Clearly, if n >>b, c then 
C z C (Zi,n — f2(n)]. Also (3) guarantees that the C z are pairwise disjoint. 

For z G let T> z := l\ n A c \i z is the smallest element of Z^2 and 

:= I z n A c otherwise. Again (3) guarantees that the T> z are pairwise 
disjoint. Clearly there exists a constant k\ c > such that all the V z are 
contained in [n — K,f c , n\. Thus the V z are also disjoint from the C z . 

In summary, we can thus conclude that, for n sufficiently large, 

\(h,n] n A c \ >8 ± + 2|Wi| + (|M 2 + 1 - 5 2 ), 

where S± = 1 if r2 + 1 G A and zero otherwise, and 62 = 1 if U2 = and 
zero otherwise. It follows immediately that \A\ < \A\ \ unless U\ and U2 are 
both empty. This completes Step 2. 

Step 3 : It just remains to show that the possibilities for A n [h,n] are 
as given by (17). By Steps 1 and 2 we only have four cases to consider, 
according to A n {r2 + 1, h}. To verify the various possibilities for 1\ one 
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considers the numbers c{r 2 + 1) — bs or cl\ — (6 + l)n as appropriate, the 
analysis being similar to that in the latter part of Step 1 above. We omit 
the details and consider the proof of Theorem 2.5 as complete. 



We close this section by showing that, in general, the bound (3) cannot 
be significantly decreased without the extremal sets avoiding x + by = cz 
looking quite different than those described in the above theorem. In par- 
ticular we have a counterexample to the conjecture in [4] that a bound of 
c > (b + l) 3 / 2 should suffice. For a counterexample we set c = b 2 . Let £ b 
denote the equation x + by = b 2 z where b > 1. The constructions considered 
earlier in this section yield that 

A >n(hh\ (b 2 -b-l)(b±-b 2 + l) 
A , A > D(b,b) = b2[b ,_ b{b + 1)] • 

But for every b > 1 the true value of Xo,c b is larger. For let 

A b : = {u • 6 3i : u > 0, i > and b t u}. 
Then clearly Af, is an /^-avoiding subset of N and 

UA , b 2 {b 2 -b- 1)(6 4 -b 2 + 1) wi 

We conjecture the following : 



Conjecture 2.7 For every n > and every b > 2 the set A b n [l,n] is an 
Cb-avoiding subset of [l,n] o/ maximum size. In particular Xo c b = Pc b = 
b 2 

b 2 +b+i ■ 

We suspect in fact that for n >>b any extremal /^-avoiding subset of 
[l,n] must be very similar to A^ n [l,n]. Frustratingly we have not been 
able to verify any of these assertions in general, not even the value of Ao,£ 6 • 
We do have proofs of Conjecture 2.7 for b = 2,3 which we now present. 
They employ the same idea, but things get pretty messy for b = 3 and we 
don't see how to make the same idea work for larger b. 

Theorem 2.8 Conjecture 2.7 holds for 6 = 2 and 6 = 3. 

Proof for 6 = 2: Fix n > 0. Put A = A 2 n [l,n]. Let B be an £ 2 - 
avoiding subset of [l,n]. We must show that | B | < | A |. We will do this 
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by exhibiting a one-to-one function 

/ : B\A - 

For k = 1,2 let 

B fc :=Sn{n-2 3i+fc : 2fn,i > 0}. 

Then 5\A = Bi U B2. For x G we shall define /(x) according as to 

whether x G B\ or x G i?2- 

First suppose x £ B\. Then x = 2y for some y G A But y B since 
i? avoids £2 and 

4-y = 2-y+(2y). 
So in this case we define f(x) = y. 

Next suppose x G B2. Then x = Ay for some y £ A. Then 3y G i, 
but 3y ^ B since .B avoids £2 and 

4(3y) =2(4y) + (4y). 

So in this case we define /(x) = 3y. 

It is clear that the restrictions of / to both B\ and B2 are one-to-one. 
So it remains to show that f(B±) n /(-B2) = 0- 

So let y,z £ A and suppose that f(2y) = /(4z). Thus y = 3z and so 
2y = 6z. So both 4z G -B and 62; G B. But this is a contradiction, since B 
avoids £2 and 

4(4z) = 2(6z) + (4z). 

Proof for 6 = 3: Fix n > 0. Put A = A 3 n [l,n] and let B be an £3- 
avoiding subset of [l,n]. As before we will describe an explicit one-to-one 
function / : B\A — > We define the sets -Bi and 1?2 in an analogous 

manner to above and for x G B\A will define f(x) according as to whether 
x G B\ or i?2- This time, both the definition of / and the proof that it 
is one-to-one will be somewhat more complicated than before, so we divide 
this process into three clear steps. 
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Step 1 : We define / on B\ and show that f \b 1 is one-to-one. 

Let x G B\. Then x = 3y for some y G A. Then 2y G A. Now it can't be 
the case that both y and 2y lie in i?, since B avoids £3 and 

9-y = 3(2y) + (3y). 

Hence we define 



/(*) 



y, tfy^B, 

2y, otherwise. 



We need to show that / is on-to-one on B±. Suppose otherwise. Then there 
is an x = 3y G B\ such that 2x = 6y G B\ and /(x) = f(2x) = 2y. But this 
implies that y £ B, which is a contradiction, since B avoids £3 and 

9-y = 3-y + {6y). 

Step 2 : We define / on B2 in such a way that 

f(B 1 )nf(B 2 )=cf>. 

Let x G -E>2- Then x = 9y for some ye A. Now 4y G A, but 4y ^ B since £> 
avoids £3 and 

9(%) = 3(9y) + (9y). 

If % G f{B±) then either 



(a) 12y G -B, in which case f(12y) = 4y, or 

(b) 12y B, but 6y £ B and /(6y) = 4y. In this case, the definition of 
/ on B\ implies that 2y G B. I also claim that in this case, y £ B U f(Bi). 
Suppose y G B. Then, since 6y G B, the equation 9 • y = 3 • y + (6y) 
contradicts the fact that .B avoids £3. So suppose y G f(B±). Then either 
y = /(3y) or y = f(3y/2). But if 3y G -B then, since both 6y and 9y are also 
in B, the equation 9(3y) = 3(6y) + (9y) contradicts the fact that B avoids 
£ 3 . And if 3y/2 G 5 then the equation 9(^) = 3(^) + (9y) likewise gives 
a contradiction. 



Hence, we begin by defining 




4y, if 12j, 5 and {2y, 6y} % B, 
y, if 12y 5 and {2y, 6y} C B. 
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It remains to define fix) when \2y G B. Notice that then 2y g" B, since 
9(2y) = 3(2y) + (12y). If 2y € /(Bi) then either 2y = /(3y) or 2y = f(6y). 
So we define 

f(x) = 2y, if 12y G B, 3y B and 6y B. 

Next suppose 3y B but 6y G B. Then y B since 9 • y = 3 • y + (6y). And 
y G" /(Bi) either, since if it were then either y = /(3y) or y = /(3y/2). But 
3y B, by assumption, and 3y/2 B since 9(^) = 3(|0 + (9y). 
Thus we may define 

f(x) = y, if 12y G B, 3y g B and 6y G B. 

Finally, it remains to define / when both 12y and 3y are in B. I claim that 
in this case, 8y g B U f{B{). Suppose 8y G B. Then 9(3y) = 3(8y) + (3y), 
contradicting B's avoidance of £3. Suppose 8y G f{B\). Then either 8y = 
/(12y) or 8y = /(24y). But /(12y) = 4y, since 4y G" B, by the definition of 
/ on B\. Otherwise 24y G B, in which case 9(9y) = 3(24y) + (9y), provoking 
another contradiction. 
Thus we may define 

f(x) = 8y, if I2y G B and 3y G B. 

This completes the definition of / on B2, and it is automatic that f(B\) n 
/(B 2 ) = 0. 

S'tep 5 : We show that f \b 2 1S one-to-one. 

So suppose that there are y, z G A with y ^ z but f(9y) = f(9z). Without 
loss of generality, f(9y)/9y < f(9z)/9z. We then have nine cases to consider. 

Case I : /(9y) = y, f{9z) = 2z and I2y B. 

Then y = 2z and {2y, 6y} C B. But 2y = 4z and then the equation 
9(4z) = 3(9z) + (9z) contradicts B's avoidance of £3. 

Case II : f(9y) = y, f(9z) = 2z and 12y G B. 

Then 12y = 24z and the equation 9(9z) = 3(24z) + (9z) contradicts B's 
avoidance of £3. 
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Case III : f(9y) = y, f{9z) = Az and 12y B. 

Then 6y = 24z G B and we get the same contradiction as in Case II. 
Case IV : f(9y) = y, f(9z) = 4z and 12y G B. 

Then we still have that 6y = 24z G B, so we get the same contradiction 
as in Case II. 

Case V : f(9y) = y, f{9z) = 8z and 12y B. 

Then I2z = 3y/2 £ B and the equation 9(^) = 3(^) + (9y) yields a 
contradiction. 

Case VI : f(9y) = y, f(9z) = 8z and 12y € B. 

Then we still have that 12z = 3y/2 G B, so we get the same contradic- 
tion as in Case V. 

Case VII : f(9y) = 2y and f(9z) = 4z. 

Then y = 2z and 12y = 24z G B, so we get the same contradiction as 
in Case II. 

Case VIII : f(9y) = 2y and f(9z) = 8z. 

Then 12z = 3y G B, contradicting the definition of / and the fact that 
/(9y) = 2y. 

Case IX : /(9y) = 4y and /(9z) = 8z. 

Then y = 2z and 3z = 3y/2 G so we get the same contradiction as 
in Case V. 

We have now completed Steps 1,2 and 3, and with that the proof of 
Theorem 2.8. 
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3 Results for Family II equations 



In this section, C denotes an equation b(x + y) = cz, where 6 and c are 
positive integers such that b > 1 and GCD(6, c) = 1. These equations were 
briefly touched on in [4], and the case 6=1 was studied in detail in [1]. We 
present a theorem which describes extremal ^-avoiding subsets of [l,n] for 
all values of b, c and n. The most interesting part of the theorem is part (i) 
which shows that the situation when c >> b is quite different from when 
6 = 1, since the extremal sets we describe are a 'hybrid' between the two 
possibilities predicted by the Question in the introduction. 



Theorem 3.1 (%) If c > 26 then, for every n > 0, the set 



A r , 



(2bn 



n 



u ie 



26n 



: x ^ (mod 6) 



is an C-avoiding subset of [l,n] of maximum size, 
(ii) If 2 < c < 26 then, for every n > 0, the set 

A' n = {x G [1, n] : x ^ (mod 6)} 

is an C-avoiding subset of [l,n] of maximum size. 
(Hi) If a = 1 then, for every n > 0, the set 



n 

26' 



n 



is an C-avoiding subset of [l,n] of maximum size. 
Note that 



\Ar 



n 



L 



2n 

c " 
n 



\A' n \ = n-[-\, 



\K\=n- L £j. 



(22) 
(23) 
(24) 



PROOF of part (i) : We fix c > 26 and proceed by induction on n. The 
theorem obviously holds if n = 1. Fix n > 1 and let i? be any ^-avoiding 
subset of [l,n]. We must show that \B\ < \A n \. 
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First suppose there exists a number z G B n (^r, -^r] which is a multi- 
ple of b. Let 2 = bz\. Then t := cz\ G (n,2n] and, since B avoids C, there 
are no solutions in B to the equation 

x + y = t. 

Now the map f : x 1—* t — x is a 1-1 mapping from the interval / := [t — n, n] 
to itself, and for each x G I, at most one of x and /(x) lies in B. Define 
s G [1, n] so that t — n = n — s + 1. Then we conclude that 

|J3n[n-s + l,n]| < L«/2J . 

If s = n then |B| < |_§J < |^4n|, by (30). Otherwise, the induction hypothesis 
yields that 

\B\ < \A n _ s \ + L|j 
. , . 2(n - s) . s 

< IAJ + 1, 

and since |5| is an integer, we conclude that |5| < |A n |, as desired. 

Thus we may assume that B contains no multiples of b in the interval 
(lT> ^cP] - ^ ^ contains no multiples of 6 at all in the range [1, then 
trivially \B\ < \A n \. So let's assume B does contain such a number, and let 
the largest such be zq. Thus zq G [1, ^]. Let = Then cxi G [c, n] 
and there are no solutions in B to 

x + y = czi. (25) 

Note that since c > 26, we have cz\ > 2zq, so if (x,y) is a solution to (33), 
then x < zq y > zq. 

Suppose cz\ = j (mod b). Then for every number x G [l,£o] such that 
x ^ j (mod b), at most one of x and c^i — x lies in B. But note that, for 
every such x, cz\ — x is not divisible by 6, and is strictly greater than zq. 
We conclude that at least z — z\ > z\ numbers are missing from B which 
are either multiples of b in [l,zo] or n °t divisible by b. This implies that 
\B\ < \A n \ and completes the proof of part (i) of Theorem 3.1. 

PROOF of Part (ii) : We divide the proof into two cases. 
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Case 1 : 2 < c < b. 



Fix n > and an /^-avoiding subset B of [l,n]. We must show that 
< \A' n \. If -B contains no multiples of b we are done, so suppose the 
contrary. Let z = bz\ be the largest element of B which is a multiple of b. 
Then it suffices to produce at least z\ numbers in the interval [l,z] which 
are not in B. Since B avoids £, it contains no solutions to the equation 

x + y = czi . 

Thus B contains no more than of the numbers in the interval [1, cz±]. 
But since 2 < c < b, it follows that z\ < [^-j and cz\ < z. Thus we are 
done. 

Case II : b < c < 2b. 

We proceed by induction on n. The theorem obviously holds for n = 1. 
Now fix n > 1 and an /^-avoiding subset B of [l,n]. If B contains no 
multiples of b then we are done, so we may assume that B contains some 
such elements. 

First suppose there exists zeBfl (|n, n] which is a multiple of b. Let 
z = bz\. Then cz\ G (n, |n] C (n,2n]. To simplify notation, set cz\ := t\ 
and ti — n := n — s\ + 1 . Since B avoids £, it contains no solutions to the 
equation 

x + y = ti. 

The map / : x \— > t\ — x is a 1-1 mapping from the interval I\ = [n — si + l,n] 
to itself and for each x G 7i at most one of the numbers x and /(x) lies in 
5. Thus 

| Bn h | < [s/2j. 

Then clearly (since b > 2) the induction argument implies that |5| < \A' n \. 

Thus we may assume that B n (-n,n] contains no multiples of b. But 
then another application of the induction hypothesis yields that \B\ < \A' n \ 
in this case too. 

Thus part (ii) of the theorem is proved. 
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Proof of Part (hi) : The argument is similar to that in Case I of part 
(ii). Let B be an ^-avoiding subset of [l,n]. If B contains no multiples of 
b, then clearly \B\ < \A'^\. So suppose z = bz\ is the largest multiple of b in 
B. Then 

71 — 7 

\Bn(z,n]\<(n-z)-[— r \. (26) 
Since B avoids C, it contains no solutions to the equation 

x + y = zi. 

Thus 

\Bn[l,z]\< z -[^\= z -l±\. (27) 

Clearly, (34) and (35) imply that \B\ < \A'^\ (with strict inequality unless 
zi=[n/b\). 

This completes the proof of Theorem 3.1. 

Concluding remark It is worthwhile to investigate if the proof of The- 
orem 3.1 can be used to obtain a stronger result, namely a classification 
of the extremal sets. We choose not to go into this matter in this paper, 
which we think already contains enough in the way of detailed, technical 
computations. In any case, the important thing is the 'hybrid' nature of the 
extremal sets in part (i) of the theorem. Note that, for each b > 2, the sets 
A n have strictly greater asymptotic density than any sets of type II'. This 
follows from (30), Lemma 1(b) of [4] and a straightforward computation. 
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